AITopics

Country:

North America > United States > Maine (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Health & Medicine (1.00)
(3 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Zhang, Lintong, Yin, Kang, Lee, Seong-Whan

Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition

arXiv.org Artificial IntelligenceNov-12-2025

Attribution-based explanation techniques capture key patterns to enhance visual interpretability; however, these patterns often lack the granularity needed for insight in fine-grained tasks, particularly in cases of model misclassifica-tion, where explanations may be insufficiently detailed. T o address this limitation, we propose a fine-grained counterfactual explanation framework that generates both object-level and part-level interpretability, addressing two fundamental questions: (1) which fine-grained features contribute to model misclassification, and (2) where dominant local features influence counterfactual adjustments. Our approach yields explainable counterfactuals in a non-generative manner by quantifying similarity and weighting component contributions within regions of interest between correctly classified and misclassified samples. Furthermore, we introduce a saliency partition module grounded in Shapley value contributions, isolating features with region-specific relevance. Extensive experiments demonstrate the superiority of our approach in capturing more granular, intuitively meaningful regions, surpassing fine-grained methods.

explanation, machine learning, natural language, (20 more...)

2511.07974

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Neural Information Processing SystemsOct-10-2025, 01:22:56 GMT

Interpretable Image Classification with Adaptive Prototype-based Vision Transformers Chiyu Ma

In our model, a prototype consists of parts, which can deform over irregular geometries to create a better comparison between images.

latent feature, prototype, protovit, (16 more...)

Country:

North America > United States > Maine (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Health & Medicine (1.00)
(3 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Neural Information Processing SystemsOct-8-2025, 21:25:24 GMT

6ffe484a646db13891bb6435ca39d667-Supplemental-Conference.pdf

artificial intelligence, machine learning, opération, (17 more...)

Country: Asia > Middle East > Israel (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Shubhendu Trivedi, David Mcallester, Greg Shakhnarovich

Discriminative Metric Learning by Neighborhood Gerrymandering

Neural Information Processing SystemsFeb-9-2025, 17:09:39 GMT

We formulate the problem of metric learning for k nearest neighbor classification as a large margin structured prediction problem, with a latent variable representing the choice of neighbors and the task loss directly corresponding to classification error. We describe an efficient algorithm for exact loss augmented inference, and a fast gradient descent algorithm for learning in this model. The objective drives the metric to establish neighborhood boundaries that benefit the true class labels for the training points. Our approach, reminiscent of gerrymandering (redrawing of political boundaries to provide advantage to certain parties), is more direct in its handling of optimizing classification accuracy than those previously proposed. In experiments on a variety of data sets our method is shown to achieve excellent results compared to current state of the art in metric learning.

artificial intelligence, machine learning, neighbor, (15 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Government > Regional Government > North America Government > United States Government (0.71)
Government > Voting & Elections (0.61)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)

Romero-Alvarado, Daniel, Martínez-Plumed, Fernando, Hernández-Orallo, José

What should an AI assessor optimise for?

arXiv.org Artificial IntelligenceFeb-1-2025

An AI assessor is an external, ideally indepen-dent system that predicts an indicator, e.g., a loss value, of another AI system. Assessors can lever-age information from the test results of many other AI systems and have the flexibility of be-ing trained on any loss function or scoring rule: from squared error to toxicity metrics. Here we address the question: is it always optimal to train the assessor for the target metric? Or could it be better to train for a different metric and then map predictions back to the target metric? Us-ing twenty regression and classification problems with tabular data, we experimentally explore this question for, respectively, regression losses and classification scores with monotonic and non-monotonic mappings and find that, contrary to intuition, optimising for more informative met-rics is not generally better. Surprisingly, some monotonic transformations are promising. For example, the logistic loss is useful for minimis-ing absolute or quadratic errors in regression, and the logarithmic score helps maximise quadratic or spherical scores in classification.

artificial intelligence, machine learning, natural language, (18 more...)

2502.00365

Country:

North America > United States > California (0.04)
North America > Puerto Rico (0.04)
Europe > United Kingdom (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.46)
Leisure & Entertainment > Games (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Mezentsev, Gleb, Gusak, Danil, Oseledets, Ivan, Frolov, Evgeny

Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs

arXiv.org Artificial IntelligenceNov-30-2024

Scalability issue plays a crucial role in productionizing modern recommender systems. Even lightweight architectures may suffer from high computational overload due to intermediate calculations, limiting their practicality in real-world applications. Specifically, applying full Cross-Entropy (CE) loss often yields state-of-the-art performance in terms of recommendations quality. Still, it suffers from excessive GPU memory utilization when dealing with large item catalogs. This paper introduces a novel Scalable Cross-Entropy (SCE) loss function in the sequential learning setup. It approximates the CE loss for datasets with large-size catalogs, enhancing both time efficiency and memory usage without compromising recommendations quality. Unlike traditional negative sampling methods, our approach utilizes a selective GPU-efficient computation strategy, focusing on the most informative elements of the catalog, particularly those most likely to be false positives. This is achieved by approximating the softmax distribution over a subset of the model outputs through the maximum inner product search. Experimental results on multiple datasets demonstrate the effectiveness of SCE in reducing peak memory usage by a factor of up to 100 compared to the alternatives, retaining or even exceeding their metrics values. The proposed approach also opens new perspectives for large-scale developments in different domains, such as large language models.

large language model, machine learning, natural language, (20 more...)

doi: 10.1145/3640457.3688140

2409.18721

Country:

Europe > Italy > Apulia > Bari (0.05)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Asia > Russia (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Neural Information Processing SystemsOct-7-2024, 08:02:33 GMT

Reviews: Adversarial Risk and Robustness: General Definitions and Implications for the Uniform Distribution

Or is it for a specific mu? - Page 4: "We did not state the loss function explicitly [...]" is missing a "." at the end.

general definition and implication, robustness, uniform distribution, (9 more...)

Technology: Information Technology > Artificial Intelligence (0.31)

Neural Information Processing SystemsMar-13-2024, 12:00:49 GMT

Discriminative Metric Learning by Neighborhood Gerrymandering

inference, learning, neighbor, (14 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Government > Regional Government > North America Government > United States Government (0.71)
Government > Voting & Elections (0.61)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)

Kotyan, Shashank, Vargas, Danilo Vasconcellos

Deep neural network loses attention to adversarial images

arXiv.org Artificial IntelligenceJun-10-2021

Adversarial algorithms have shown to be effective against neural networks for a variety of tasks. Some adversarial algorithms perturb all the pixels in the image minimally for the image classification task in image classification. In contrast, some algorithms perturb few pixels strongly. However, very little information is available regarding why these adversarial samples so diverse from each other exist. Recently, Vargas et al. showed that the existence of these adversarial samples might be due to conflicting saliency within the neural network. We test this hypothesis of conflicting saliency by analysing the Saliency Maps (SM) and Gradient-weighted Class Activation Maps (Grad-CAM) of original and few different types of adversarial samples. We also analyse how different adversarial samples distort the attention of the neural network compared to original samples. We show that in the case of Pixel Attack, perturbed pixels either calls the network attention to themselves or divert the attention from them. Simultaneously, the Projected Gradient Descent Attack perturbs pixels so that intermediate layers inside the neural network lose attention for the correct class. We also show that both attacks affect the saliency map and activation maps differently. Thus, shedding light on why some defences successful against some attacks remain vulnerable against other attacks. We hope that this analysis will improve understanding of the existence and the effect of adversarial samples and enable the community to develop more robust neural networks.

class activation map, gradient-weighted class activation map, neural network, (10 more...)

2106.05657

Country: Asia > Japan > Kyūshū & Okinawa > Kyūshū (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.64)